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VARIABLE-RESOLUTION  IMAGERY  FOR  FLIGHT  SIMULATION 


SUMMARY 

For  spatial  discrimination  tasks,  the  center  of  the  visual  field  is  more  sensitive  than  the 
visual  periphery.  This  observation  is  relevant  to  the  design  of  visual  simulators  in  that  it  suggests 
that  information  can  be  removed  in  the  peripheral  parts  of  a  simulator  image  without  affecting  the 
quality  of  the  simulation.  If  this  can  be  done,  then  smaller,  less  expensive  computer  systems  can 
be  used  to  store,  manipulate,  and  transmit  imagery  at  a  given  level  of  fidelity.  Although  there  are 
differences  in  sensitivity  across  the  visual  field,  in  most  cases  central  and  peripheral  sensitivity  can 
be  equated  if  the  peripheral  stimulus  is  appropriately  magnified.  Rather  than  applying  spatially 
varying  magnification,  we  processed  images  using  a  spatially  varying  operator  that  low>pass 
filtered  (i.e.,  blurred)  the  images  relatively  little  at  their  center  and  progressively  more  at  greater 
distances  from  the  center.  The  result  is  called  a  variable-resolution  image.  A  series  of  such 
images  processed  at  various  levels  was  used  to  estimate  the  blur  gradient  that  was  just  detectable 
by  human  observers.  Wide-field  (80®  diameter),  complex,  real-world  images  were  used  in  order 
to  approximate  the  imagery  used  in  visual  simulators.  It  was  possible  to  obtain  (from  the 
variable-resolution  imagery)  imagery  suitable  for  use  in  visual  simulation  since  the  processing 
techniques  described  here  are  consistent  with  a  formalism  which  allows  appropriately  filtered 
images  to  be  efBciently  sampled  and  hence  represented  with  less  information.  Based  on  the 
variable-resolution  functions  determined  to  be  near  threshold  for  our  observers,  we  can  generate 
effective  variable-resolution  \asual  imagery  using,  at  most,  one-quarter  of  the  information 
associated  with  conventional  imagery. 


GENERAL  INTRODUCTION 

The  research  described  here  was  designed  to  perceptually  evaluate  variable-resolution 
imagery  whose  spatial  fi*equency  content  decreases  as  a  function  of  distance  fi’om  its  center 
(Zeevi,  Porat,  &  Geri,  1990).  Such  imagery  is  potentially  relevant  to  the  design  of  visual 
simulators  in  that  it  can  be  madb  to  appear,  to  a  human  observer,  to  be  equivalent  to  an  image 
whose  spatial  fi-equency  content  is  equally  high  at  all  points.  Once  an  equivalent 
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variable-resolution  image  is  found,  formal  procedures  (Clark,  Palmer,  &  Lawrence,  1985;  Zeevi  & 
Shlomot,  1993)  exist  for  efficiently  sampling  the  image.  The  more  efficient  sampled  image  can 
then  be  used  in  place  of  the  original  image  in  the  visual  simulator. 

We  begin  here  with  a  brief  discussion  of  conventional,  position-invariant  image  processing 
techniques  in  order  to  distinguish  them  from  the  position-varying  techniques  used  in  the  present 
study.  We  discuss  some  of  the  characteristics  of  images  processed  by  our  technique,  as  well  as 
the  concept  of  local  bandwidth,  and  briefly  describe  a  formal  technique  for  appropriately  sampling 
non-uniform  imagery.  Following  this  general  introduction,  two  experiments  designed  to 
perceptually  evaluate  variable-resolution  imagery  are  described. 


Conventional  Image  Processing 

rtmventional  techniques  of  image  processing,  which  are  carried  out  in  the  context  of 
linear  systems  theory,  are  applicable  only  when  the  systems  used  to  operate  on  (or  process)  the 
image  are  linear  and  position-invariant.  Under  these  conditions,  it  is  possible  to  operate  on  an 
image  either  in  the  space  domain  (by  convolution)  or  in  the  frequency  domain  (by  filtering).  For 
example,  operations  in  the  space  domain  include  smoothing  an  image  in  order  to  improve  the 
signal-to-noise  ratio,  or  extracting  edges  in  order  to  perform  pattern  recognition  or  contour 
identification.  The  corresponding  operations  in  the  frequency  domain  would  be  low-pass  and 
band-pass  filtering,  respectively.  The  operation,  in  the  space  domain,  can  be  represented  formally 
as; 


lc(.x,y)  =  JJ  li(x,y)  ■  k(x-x,y-y')di  dy 

i  l,(x.y)»‘Hx.y)  (1) 

where  l/x',y')  and  I/x.y)  denote  the  input  and  output  images,  respectively,  k(x-x\y-y')  is  the 
convolution  kernel  which  represents  the  mask  used  to  operate  on  the  image,  and  the  double 
asterisk  (**)  denotes  a  two-dimensional  convolution.  In  the  spatial  frequency  domain,  the 
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corresponding  expression  is  obtained  by  Fourier  transforming  Equation  1  yielding; 


LoifSix,  ®>»)  —  (2) 

where  and  are  the  spatial  frequency  spectra  of  the  input  end  output  images,  respectively,  and 
K((o^(Oy)  is  the  modulation  transfer  function  (MTF)  of  the  system.  An  example  of  an  image 
generated  using  a  discrete  implementation  of  the  convolution  operation  of  Equation  1  is  shown 
on  the  right  in  Figure  1 .  The  original  unprocessed  image  is  shown  on  the  left.  The  convolution 
was  performed  with  a  39  x  39-pixel  gaussian  kernel.  The  summation  kernel  was 
position-invariant  and  thus  the  image  has  been  low-pass  filtered  (i.e.,  blurred)  equally  at  all 
positions. 

Variable-Resolution  Imagery 

In  order  to  process  images  in  a  position-varying  fashion  similar  to  that  taking  place  in  the 
visual  system,  we  cannot,  for  the  reasons  described  above,  apply  the  classical  techniques  of  linear 
systems  theory.  It  is  necessaiy,  therefore,  to  devise  techniques  which  are  analogous  to 
convolution  and  other  linear  operations,  but  which  are  not  position-invariant.  The  technique  we 
have  chosen  involves  operating  on  a  fixed-resolution  image  with  a  system  whose  characteristics 
vary  from  point  to  point.  Thus,  the  kernel,  k(x-x',y-y),  shown  in  Equation  1,  which  is  a  function 
only  of  the  differences  between  pairs  of  variables  along  the  two  spatial  coordinates,  becomes 
k(x,x\y,y'),  which  is  a  function  of  the  two  stimulus  independent  variables  and  the  two  response 
independent  variables.  Thus,  in  this  case: 

lo{x,y)  =  j|  li{x',y)  •  k(x,x\y,y')dx  dy  (3) 


In  accordance  with  Zeevi,  Peterfreund,  and  Shlomot  (1988),  we  will  refer  to  the  function  l/x,y)  as 
a  variable-resolution  image.  An  example  of  a  variable-resolution  image  is  shown  on  the  right  in 
Figure  2.  Again,  the  original  unprocessed  image  is  shown  on  the  left.  The  image  was  processed 
using  a  discrete  implementation  (see  Methods,  Experiment  1)  of  the  integral  operation  shown  in 
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Convolution 
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Figure  1 

Low-Pass  Filtering  Using  a  Position-Invariant  Gaussian  Kernel 


Variable  Resolution 
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Figure  2 

Variable  Resolution  Produced  Using  a  Position-Varying  Gaussian  Kernel 


Equation  3.  The  processing  was  performed  vnth  a  gaussian  kernel  whose  size  varied  from  1  x  1 
pixels,  at  the  center  of  the  image,  to  49  x  49  pixels  at  the  edge  of  the  image. 

Since  Equation  3  is  not  of  the  form  which  defines  a  convolution,  the  Fourier  transform 
cannot  be  applied  to  it,  and  thus  an  MTF  cannot  be  defined  for  the  system  (filter)  which  was  used 
to  operate  on  the  image.  Since  it  is  not  possible  to  condition  the  spectral  distribution  of  an  image 
by  multiplying  it  by  the  transfer  function  of  the  system,  it  is  always  necessary  to  operate  in  the 
space  domain  by  solving  the  integral  equation  which  represents  the  operation  of  a  filter  whose 
form  depends  on  position. 

Unlike  the  convolution  operation  of  Equation  1,  which  can  be  performed  only  in  the 
context  of  linear,  position-invariant  systems,  the  integral  operation  described  by  Equation  3  may 
transform  a  bandlimited  signal  into  a  signal  whose  bandwidth  is  not  limited.  In  the  case  of 
systems  which  are  linear  and  position-invariant,  this  cannot  occur  as  can  be  concluded  most  easily 
from  an  examination  of  the  operation  in  the  frequency  domain.  Since  the  convolution  integral  is 
transformed  into  the  product  of  the  spectrum  of  the  input  signal  and  that  of  the  MTF  of  the 
system,  it  is  clear  that  no  extra  frequency  components  can  be  introduced.  It  is  well  known, 
however,  that  nonlinear  systems  introduce  frequency  components  which  are  not  present  in  the 
input  signal.  Extra  frequency  components  or  bandwidth  expansion  can  also  result  from  a 
superposition  integral  of  the  type  shown  in  Equation  3,  which  represents  a  linear  but 
position-varying  operation. 

Bandwidth  expansion  of  an  input  signal  by  the  addition  of  frequency  components  is  more 
often  encountered  in  the  case  of  nonlinear  operations  (i.e.,  nonlinear  systems).  However,  this  type 
of  bandwidth  expansion  can  also  result  from  the  operation  of  a  system  which  is  linear  but 
position-varying.  Consider  a  signal  which  is  bandlimited  and  thus  satisfies  the  Nyquist  condition. 
If  the  signal  is  transformed  by  distorting  the  position  axis,  the  resulting  signal  no  longer  satisfies 
the  Nyquist  condition  in  that  the  signal  is  no  longer  bandlimited.  The  process  of  distortion  is 
analogous  in  a  way  to  FM  modulation  of  a  bandlimited  signal  which  introduces  extraneous 
frequency  components  (in  fact,  extends  the  band  to  infinity).  The  distorted  signal,  therefore, 
cannot  be  represented  by  a  discrete  set  of  samples. 
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Locally-Bandlimited  Imagery 

According  to  classical  sampling  theory  (Jerri,  1977),  a  signal  (or  image)  must  be 
convolved  with  a  sine  function  (low-pass  filtered)  prior  to  being  sampled  so  that  its  bandwidth  is 
appropriately  (inversely)  related  to  the  required  sampling  interval.  This  filtering  operation 
projects  the  image  into  a  space  of  bandlimited  functions  (BLFs)  so  that  it  can  be  represented 
using  a  finite  sampling  rate.  There  are  infinitely  numy  such  spaces,  denoted  B“,  each  defined  by 
its  associated  bandwidth,  O  (see  e  g.,  Zeevi  &  Shlomot,  1993). 

There  are  certain  conditions,  however,  under  which  an  image  can  be  represented  by  a 
finite  number  of  samples  even  if  it  is  not  bandlimited.  For  example,  if  we  had  begun  with  a 
bandlimited  image  and  applied  our  variable-resolution  procedure  to  it,  the  resulting  image  would 
no  longer  be  bandlimited.  However,  because  the  underlying  distortion  fiinction,  used  to  alter  the 
spatial  distribution  of  information  in  the  image,  is  of  a  form  for  which  an  inverse  exists  (cf ,  Clark, 
et  al,  1985),  that  inverse  can  be  used  to  restore  the  image  to  one  whose  information  is  distributed 
uniformly  and  which  is  again  bandlimited.  In  this  case,  the  generalized  sampling  procedure  can  be 
applied  and  the  resulting  variable-resolution  image  can  be  completely  represented  by  a  finite  set  of 
samples.  Images  which  are  not  bandlimited  but  can  be  converted  to  bandlimited  images  by  the 
application  of  the  inverse  of  a  distortion  function  are  said  to  belong  to  the  space  of 
locally-bandlimited  functions  (LBLFs).  This  space  is  denoted  by  B^,  where  O  is  now  the  local 

bandwidth  (see  below),  and  y  corresponds  to  the  associated  distortion  function. 

As  noted  earlier,  in  the  case  of  variable-resolution  images  the  integration  kernel  is 
spatially  varying  and  therefore  the  associated  integral  operation  (Equation  3)  is  not  a 
convolution.  Nevertheless,  assuming  that  the  integration  kernel  is  a  sinc-flmction,  this  operation 
projects  images  onto  a  well-defined  space  of  LBLFs  (Zeevi  &  Shlomot,  1993).  Analogous  with 
the  bandwidth  which  defines  each  space  of  BLFs,  there  is  a  measure,  called  local-bandwidth, 
which  defines  each  space  of  LBLFs.  The  concept  of  local  bandwidth  may  at  first  appear  to  be 
ill-defined  in  that  it  seems  to  be  self-contradictory  according  to  the  uncertainty  principle. 
According  to  this  principle,  in  order  to  have  a  limited  bandwidth,  the  function  has  to  be  of  infinite 
extent  and  the  information  regarding  the  bandwidth  (which  determines  in  turn  the  resolution  in 
frequency)  is  determined  in  relation  to  the  entire  extent  of  the  signal.  The  term  local  bandwidth 
implies  that  the  frequency  characteristics  are  derived  from  local  properties,  hence  the 
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contradiction.  It  is  therefore  important  to  elaborate  on  this  issue.  The  concept  of  local  bandwidth 
can  best  he  understood  in  the  context  of  signal  representation  in  the  combined  position-frequency 
space  where  a  different  bandwidth  can  be  specified  for  each  position  (within  the  limitations 
imposed  by  the  joint  uncertainty;  Gabor,  1946).  Thus,  one  interpretation  of  local  bandwidth  is 
that  the  local  properties  of  the  image  can  be  used  to  determine  a  transformation  of  it  which  wilt 
produce  a  globally  bandlimited  image  (i.e.,  one  that  belongs  to  B^.  Another  interpretation  of  the 
local  bandwidth  can  be  understood  in  the  context  of  a  position-varying  MTF  analogous  to  the  one 
introduced  by  Zadeh  (I9S2)  in  his  analysis  of  time-varying  systems.  In  analogy  to  the  formalism 
proposed  by  Zadeh,  the  MTF  is  a  function  of  both  position  and  frequency,  and  as  such,  one  can 
formally  define  the  local  bandwidth  for  a  given  position.  Similarly,  Horiuchi  (1968)  proposed  that 
if  a  signal  can  be  obtained  by  the  inverse  Fourier-like  operation  defined  as  follows. 

1  cmx) 

^  exp  (/(or)  cAo  (4) 

27C  J-kH^x) 

where  Ffx.atJ  is  a  position-dependent,  Fourier-like,  transform  of  /(xj,  then  the  signal  has  a 

position- varying  bandwidth  fV(xJ.  Zeevi  and  Shlomot  (1993)  extended  the  concept  of 
position-varying  bandwidth  by  showing  that  any  signal  which  belongs  to  is  also  characterized 
by  a  local  bandwidth. 

Once  an  analogy  is  drawn  between  uniform  and  nonuniform  operations,  the  concepts  of 
"local  bandwidth"  and  "B^"  (and  their  application  to  the  generation  of  variable  resolution 
imagery)  become  relatively  simple  and  straightforward.  It  should  be  stressed  that  the  proposed 
technique  for  projecting  a  given  image  into  B^  is  the  proper  way  of  dealing  with  the  issue  of 
variable  resolution  in  that  the  resultant  local  properties  are  well-defined,  and  in  that  such  an  image 
can  be  represented  by  a  (finite)  set  of  samples  and  reconstructed  without  any  aliasing  effects  even 
though  it  is  not  bandlimited  and  as  such  does  not  satisfy  the  Nyquist  condition.  Further,  such  a 
technique  permits  additional  nonuniform  processing  in  the  form  of  nonuniform  pyramids  and  other 
schemes  (Peterfreund  &  Zeevi,  1992;  Zeevi  er  a/.,  1988). 

Sampling  Nonuniform  Images 

If  an  image  has  been  transformed  (e  g.,  into  a  variable-resolution  image)  by  a  known 
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distortion  function,  there  are  in  principle  two  ways  by  which  it  can  still  be  represented  by  adiscrete 
set  of  samples.  First,  the  signal  can  be  restored  (undistorted)  and  then  represented  like  any  other 
bandlimited  signal.  Image  reconstruction  is  then  performed  in  accordance  with  the  provisions  of 
the  Whittaker-Shannon  sampling  theorem  using  an  interpolation  filter  in  the  form  of  a  sine 
function  (cf ,  Jerri,  1977).  Stated  anotho*  way,  if  an  image  can  be  projected  into  the  space  of 
BLFs  by  a  transformation  along  the  spatial  axis,  then  it  can  be  adequately  represented  by  a 
discrete  set  of  samples.  The  second  way  of  representing  a  transformed  image  by  a  discrete  set  of 
samples  is  to  distort  and  position  the  interpolation  functions  (sine  functions)  nonuniformly  in 
accordance  with  the  positional  distortion  function,  and  then  represent  the  signal  even  though  it  is 
not  bandlimited  and  as  such  does  not  satisfy  the  Nyquist  condition  (Clark  et  al,  1985;  Zeevi  & 
Shlomot,  1993). 

If  the  distortion  function,  as  described  above,  is  not  known,  or  if  it  is  not  known  whether 
the  image  even  belongs  to  the  space  of  LBLFs,  a  number  of  theoretical  and  practical  questions 
arise.  For  example,  if  it  is  not  known  what  type  of  nonuniform  processing  was  used  on  a  given 
bandlimited  image  to  generate  a  variable  resolution  image,  how  can  it  be  determined  whether  the 
latter  belongs  to  any  space  of  LBLFs  and,  if  so,  to  which  of  the  infinite  number  of  such  spaces? 
Theoretically,  in  order  to  determine  whether  an  image  belongs  to  a  particular  space  of  LBLFs, 
one  would  have  to  first  apply  the  nonuniform  filter  which  would  project  that  (or  any  other)  image 
into  that  space.  If  the  resultant  image  is  identical  to  the  original  one,  this  implies  that  the  image 
belonged  to  the  space  to  start  with,  and  that  the  space  itself  is  a  so-called  reproducing  kernel 
space  (Aronszajn,  1950).  Since  there  are  infinitely  many  such  spaces,  it  is  not  possible  of  course 
to  proceed  in  this  way.  The  alternative,  practical,  approach  is  to  devise  techniques  for  estimating 
a  distortion  function  which  optimally  matches  the  nonuniform  distribution  of  information  over  the 
image  field.  The  techniques  nught  be  based  on  estimating  statistically  the  rate  of  zero-crossings 
fi-om  a  number  of  images,  or  by  estimating  the  effective  local  bandwidth  which  characterizes  the 
representation  of  the  image  in  some  combined  position-frequency  space,  such  as  those  based  on 
the  Gabor  scheme,  the  Wigner  distribution,  or  the  complex  spectrogram  (cf ,  Zeevi  &  Shlomot, 
1993).  Estimation  of  the  optimal  distortion  function  will  determine  which  of  the  infinite  number 
of  locally  bandlimited  image  spaces  the  given  image  fits  best  in  the  sense  of  having  a  minimal 
distance  fi'om  the  original  image  according  to  some  criterion.  Several  techniques  have  been 
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proposed  by  Zeevi  and  Shlomot  (1993)  for  estimating  the  distortion  function  which  best 
represents  a  given  image  in  that  once  the  corresponding  projection  to  the  (well-defined)  space  of 
LBLFs  is  performed,  a  minimal  least-mean-square  error  results. 

In  the  context  of  the  present  study,  the  problems  are  somewhat  simpler  because  either  the 
exact  distortion  function,  corresponding  to  a  given  image,  is  known  or  the  type  of  nonuniform 
processing  applied  to  a  bandlimited  image  is  known  and,  as  such,  can  be  used  in  estimating  the 
distortion  function.  As  a  practical  matter,  we  need  to  determine  how  to  nonuniformly  distribute 
the  sampling  points  such  that  the  interpolated  variable  resolution  image  will  be  transformed  into 
an  LBLF.  Starting  with  a  standard  (uniform-resolution)  image,  how  can  one  devise  a  nonuniform 
filter  which  will  result  in  the  desired  distribution  of  resolution  across  the  image  while  at  the  same 
time  providing  a  formalism  which  satisfies  certain  theoretical  constraints  related  to  the  proper 
representation  of  the  image  by  a  set  of  sampling  points  and  the  reconstruction  of  the  image  from 
that  set  with  no  aliasing  effects? 

Sampling  Locally  Bandlimited  Images 

The  purpose  of  development  of  the  variable  resolution  technique  is  its  eventual  application 
in  efficient  image  representation  in  flight  simulators  (and  other  types  of  high-fidelity,  wide 
field-of-view  display  systems).  Thus,  for  the  purpose  of  image  storage  and/or  manipulation,  it  is 
desired  to  exploit  the  variable-resolution  properties  of  the  image  such  that  the  data  set  can  be 
reduced  accordingly.  In  fact,  this  is  the  very  reason  for  generating  variable  resolution  imagery.  In 
some  flight  simulators,  computer-generated  images  are  stored  and  manipulated  as  a  discrete  set  of 
points,  (i.e ,  a  two-dimensional  aay  of  samples).  The  question  then  is,  how  should  the  sampling 
points  be  nonuniformly  distributed  over  the  image  field  such  that  they  adequately  represent  the 
image  in  the  sense  that  the  variable  resolution  image  can  be  reconstructed  from  this  set  of 
sampling  points.  Intuitively,  it  is  clear  that  as  the  data  become  more  sparsely  distributed  as  a 
result  of  variable  resolution  processing,  the  required  density  of  sampling  points  decreases.  A 
sampling  theorem  for  images  which  belong  to  was  first  introduced  by  Clark,  et  al.  (1985). 

This  theorem  was  extended  by  Zeevi  and  Shlomot  (1993)  to  allow  images  to  be  sampled  using 

other  than  rectilinear  patterns.  As  stated  earlier,  Zeevi  and  Shlomot  also  showed  that  the  kernel 
which  projects  images  into  is  a  reproducing  kernel.  A  formalism  for  reconstructing  an  image. 
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f(x),  from  a  properly  chosen  set  {x„}  of  nonuniformly  distributed  samples  will  now  be  described. 

To  highlight  tiie  analogy  that  exists  between  uniform  and  nonuniform  sampling  schemes 
and  to  better  understand  the  latter,  we  consider  first  the  case  of  uniform  sampling.  Let  g(x) 
belong  to  the  space  (i.e.,  assume  that  the  area  of  support,  O,  defined  by  its  spectrum  is 
limited),  and  let  the  periodicity  matrix,  U,  represent  the  periodic  extension  of  the  spectrum,  G(a)), 
associated  with  the  sampling  operation.  Then,  g(x)  can  be  reconstructed  (interpolated)  from  its 
samples  along  the  uniform  grid,  x„  =  V  n,  [where  the  sampling  matrix,  V,  satisfies  the  condition 
that  the  inner  product  of  U  and  V  equals  a  constant  times  the  identity  matrix,  (i.e.,  ITV  =  2x1)], 
according  to  the  foUowing  formula; 

g(x)  =  |det  V|  2  -  Vn)  (5) 

n 

where  g(^J  represent  amplitudes  of  the  interpolation  function,  ^xj,  defined  by, 

^  ^  JJfj  «xp(/(i)^x)  f/ffl  (6) 

Thus  ^xj  represents  the  Fourier  transform  of  a  rectangular  pulse  of  unit  height  and  dimension  Q 
X  Q  (i.e.,  a  2-D  sine  function). 

We  now  extend  the  above  theorem  to  include  functions  which  belong  to  B^.  If  g(x)  is  a 
globally  BLF  (i.e.,  if  g(x)  e  B^),  then  f(x)  is  a  LBLF  (i.e.,y(*)  e  B^),  if  and  only  if,  a  distortion 
function,  y(x),  exists  such  that  f(x)  =  g[Y(x)]  (i.e.,  such  that  f(x)  can  be  expressed  as  a  function  of 
g,  and  hence  is  also  a  member  of  B”).  Note  that  if  y(x)  =  x,  then  the  two  above-described  spaces 
become  identical.  Such  an  LBLF  can  be  reconstructed  from  the  properly  distributed  set  of 
samples,  {x„},  where  x„  =  y  '(V  n),  according  to  the  following  formula: 

y(x)  =  MyWI  =  |det  Vj  2  g(Vn)  ^[y(x)  -  Vn 

n 

=  Idet  V|  2  WrCi)  -  Vn)  (7) 
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which  represents  interpolation  using  "uniformized”  samples.  Note  that  the  effect  of  y(x)  was  to 
change  the  distance  between  pulses  resulting  in  nonuniform  sampling  and  a  partial  overlap  of 
G( (o).  The  resulting  function  is  not  bandlimited  and  hence  abating  could  occur. 

The  implementation  of  the  above  formalism  can  be  better  understood  by  considering  the 
reconstruction  of  a  one-dimensional  signal  in  which  case  Equation  7  becomes; 


00 

2  y(x„)  sinc[‘Kx)-/i] 

Ifs-OO 


(8) 


where  sinc(x)  =  sin(7u:)/7cr  . 

Substituting  y(x)  =  ff^xj  x  into  Equation  8  yields; 


00 

Ax)=  2  tinc[FK*)  *-»] 

f|s-00 


(9) 


where  fF(xJ  provides  a  measure  of  the  sampling  density  and  of  the  local  bandwidth.  As  noted 
above,  if  jiicJ  =  x,  the  function /(3rj  belongs  to  B°.  Indeed,  in  that  case,  IV^xJ  =  1  and  we  have 
uniform  sampling  density,  and  Equation  9  becomes  the  standard  equation  of  mterpolation  from  a 
uniformly  distributed  set  of  samples. 


EXPERIMENT  1.  PERCEPTUAL  ASSESSMENT  OF 
WIDE-FIELD,  VARIABLE-RESOLUTION  IMAGERY 

INTRODUCTION 

The  well-documented  variations  in  spatial  discrimination  across  the  visual  field  (Levi, 
Klein,  &  Aitsebaomo,  1985;  Rovamo  &  Virsu,  1979;  Virsu,  NSsSnen,  &  Osmoviita,  1987) 
suggest  that  conventional,  position-invariant  processing  techniques  cannot  produce  images  which 
are  efficiently  matched  to  the  human  visual  system.  In  order  to  match  the  spatial  inhomogeneities 
of  the  human  visual  system,  images  must  be  processed  in  a  position-varying  manner.  This  would 
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be  necessary,  for  instance,  to  effectively  implement  high  resolution  over  a  wide  field  of  view  given 
limited  computational  and  bandwidth  resources  (Zeevi  et  aJ.,  1990).  Techniques  have  recently 
been  developed  (Peterframd  &  Zeevi,  1990;  1992;  Zeevi  &  Shlomot,  1993;  Zeevi  et  ai,  1988) 
for  filtering  images  by  position-varying  techniques  in  a  manner  analogous  to  the  low-pass  filtering 
performed  prior  to  sampling  with  fixed  sampling  rate  (i.e.,  position-invariant  sampling). 

As  noted  above,  the  information  processing  c^ability  of  the  human  visual  system 
decreases  from  the  center  of  the  visual  field  to  the  visual  periphery.  A  so-called  cortical 
magnification  factor  (CMF)  can  be  specified  which  relates  the  sensitivity  at  a  given  retinal 
eccentricity  to  that  at  the  fovea.  The  CMF  is  presumed  to  reflect  the  relatively  greater  number  of 
cortical  cells  associated  with  a  given  retinal  area  near  the  fovea  as  compared  to  an  equivalent  area 
in  the  periphery  (Daniel  &  Whitteridge,  1961;  Talbot  &  Marshall,  1941;  Van  Essen,  Newsome  & 
Maunsell,  1984).  Thus,  it  follows  that,  for  those  visual  tasks  that  can  be  scaled,  the  number  of 
cortical  cells  stimulated  can  be  equated  for  stimuli  in  the  center  and  periphery  if  the  latter  are 
appropriately  magnified. 

In  Experiment  1,  test  images  were  generated  using  a  position-varying  filter  whose 
bandvridth  decreased  as  a  function  of  distance  from  the  center  of  the  image.  The  specific 
functions  tested  were  chosen  to  approximate  the  observed  variation  in  the  CMF  across  the  visual 
field  (Rovamo  &  Virsu,  1979).  There  are  disagreements  in  the  literature  as  to  the  specific 
function  that  relates  CMF  to  eccentricity  (Dow,  Snyder,  Vautin  &  Bauer,  1981;  Levi  et  al,  1985; 
Rovamo  &  Raninen,  1984),  but  this  issue  is  not  addressed  here.  Rather,  several  candidate  linear 
functions  were  chosen  that  differed  in  central  blur  (at  8“  eccentricity),  peripheral  blur  (at  40° 
eccentricity),  and  the  blur  gradient  between  8°  and  40°  eccentricity.  The  function  that  produced  a 
blur  gradient  which  was  just  discriminable  from  an  unprocessed  image  was  taken  as  the  distortion 
funaion  which  may  in  turn  be  used  to  efficiently  sample  (i.e.,  subsample)  real-world  images  such 
as  those  used  in  visual  simulators.  Although  the  processing  performed  here  is  not  strictly  an 
image  magnification,  there  is  some  analogy  in  the  change  in  spatial  frequency  content  of  the 
images  resulting  fi-om  these  two  procedures.  Thus,  given  that  formal  procedures  have  been 
developed  for  sampling  and  generating  variable-resolution  imagery  (Clark  et  al,  1985;  Zeevi  et 
al ,  1988),  this  imagery  may  be  most  useful  for  producing  efficient  visual  simulation. 
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METHOD 


Generation  of  Variable-Resolution  Images 

A  locally  bandltmited  variable  resolution  image  can  be  produced  only  by  using  a  projection 
technique,  similar  to  that  described  by  Equation  3,  wherein  a  (variable)  sine  function  is  used  as  the 
integration  kernel.  Such  a  technique  is  computationally  intensive  and  relatively  difficult  to 
implement,  we  have,  therefore,  chosen  an  alternative  technique  which  uses  an  integration  kernel 
based  on  a  gaussian  function  whose  space  constant  varied  with  position  in  the  image.  Although 
this  approach  does  not  conform  to  any  complete  formalism,  which  might  otherwise  be  used  to 
fully  characterize  the  resulting  image  for  the  purposes  of  further  processing  and  representation,  it 
is  sufficient  for  obtaining  preliminary  psychophysical  data. 

Because  we  desire  the  final  variable-resolution  image  to  be  radially  symmetric  (i.e.,  to 
display  the  same  amount  of  degradation  at  all  points  equidistant  from  the  center),  it  is  convenient 
to  first  transform  the  ori^nal  image  from  cartesian  to  polar  coordinates  before  applying  the 
various  distortion  functions  that  determine  how  the  low-pass  characteristics  change  across  the 
image  space.  In  fact,  the  polar-coordinate  representation  is  inherently  appropriate  for  generating 
variable-resolution  imagery  in  that  it  allocates  a  larger  area  to  central  areas  of  the  cartesian 
version  of  the  image  than  to  peripheral  ones.  Also,  it  presents  the  image  data  points  in  an  ordered 
manner  that  makes  the  integration  (or  summation  in  the  discrete  case)  more  convenient.  After  the 
integration  operation  was  performed,  the  image  was  transformed  back  into  the  cartesian 
coordinate  system.  Variable-resolution  images  were  then  generated  using  a  discrete 
implementation  of  the  integral  operation  described  by  Equation  3  ; 

io(x,y)  =  22  hix'y)  k{x,y,x'y)  (10) 

m  n 

where  k  is  the  gaussian  summation  kernel  (see  Appendix  A).  The  width  of  the  kernel  varied  along 
both  the  radial  and  angular  coordinates  and  was  determined  by  sampling  a  function  of  the  form; 
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(11) 


=  exp 


jx-xf)  +  (y-/) 
D 


where  x'  and  y’  are  the  coordinates  of  the  image  pixel  over  which  the  summation  komel  was 
centered,  and  D  is  the  effective  width  (±lo)  of  the  gaussian.  The  summation  kernels  varied  in 
size  from  3  x  3  to  13  x  13  pixels.  Following  the  summation  operation,  the  image  was  transformed 

back  into  the  cartesian  coordinate  system.  A  transformation  from  cartesian  to  polar  coordinates 
and  back  may  result  in  approximation  errors  since  sampling  is  discrete  and  hence  not  all 
transformed  points  can  be  accurately  represented  in  both  coordinate  planes.  At  the  level  of 
resolution  used  in  the  present  study,  the  transformation  did  not  generate  any  noticeable  image 
degradation  which  might  have  been  mistaken  for  variable  resolution. 

The  linear  functions  shown  in  Figure  3  were  chosen  to  vary  the  parameters  which  specified 
the  integration  kernel  as  a  function  of  distance  from  the  center  of  the  image.  These  functions 
were  chosen  to  approximate,  to  varying  degrees,  the  cortical  magnification  functions  (CMFNs) 
derived  from  anatomical  and  psychophysical  data  (Dow  et  al,  1981;  Rovamo  &  Virsu,  1979). 
The  full  set  of  functions  shown  in  Figure  3  allows  an  assessment  of  central  blur,  peripheral  blur, 
and  blur  gradient,  although  all  of  these  factors  were  not  tested  independently  of  the  others.  Linear 
functions  were  considered  acceptable  since  they  give  a  good  approximation  to  published  CMFNs. 
For  instance,  between  8**  and  40°  eccentricity,  the  CMFN  suggested  by  Rovamo  and  Virsu  is 
nearly  linear  and  varies  from  7  to  9  (corresponding  here  to  our  kernel  size)  over  that  range. 

The  stimuli  used  in  the  present  study  extended  from  8°  to  40°  eccentricity.  The  central 
portion  of  the  stimulus  was  removed  for  two  reasons.  First,  the  CMFN  changes  rapidly  near  the 
fovea  and  the  available  display  resolution  was  not  high  enough  to  give  an  accurate  representation 
in  that  portion  of  the  visual  field.  Second,  it  was  expected  that  observers  would  find  it  difficult  to 
distribute  their  attention  over  a  stimulus  area  that  was  80°  in  diameter.  This  would  have  been 
especially  difficult  if  they  were  asked  also  to  attend  to  visual  detail  presented  at  or  near  the 
fixation  point. 

Observers 

The  observers  were  two  males  (SF  and  GG)  and  one  female  (KV)  who  were  25, 41,  and 
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Figure  3 

Variable-Resolution  Functions  Tested  in  Exi 


24  years  of  age,  respectively.  All  observers  had  normal  uncoirected  vision.  Observers  SF  and 
KV  were  paid  for  their  participation,  while  observer  GG  was  one  of  the  authors. 

Stimuli  and  Apparatus 

Two  real-world  aerial  photographs  were  di^tized,  and  four  stimulus  templates  were 
constructed  by  pairing  each  half  of  the  two  photographs  ^th  its  mirror  image  and  removing  a 
portion  of  the  center  of  the  image.  One-half  of  each  stimulus  was  then  low-pass  filtered  (i.e., 
blurred)  in  accordance  with  one  of  the  functions  of  Figure  3.  Two  of  the  stimulus  templates, 
corresponding  to  one  blur  gradient  (much  higher  than  those  used  in  the  present  study)  applied  to 
the  left  half  of  each  image,  are  shown  in  Figure  4.  The  programs  used  to  produce  the  stimulus 
templates  for  both  Experiments  1  and  2  are  presented  in  Appendix  B.  The  full  stimulus  set 
consisted  of  108  images.  Of  these,  84  were  test  stimuli  corresponding  to  the  four  image  templates 
each  processed  at  one  of  the  21  filter  gradients.  The  remaining  24  stimuli  were  control  stimuli 
consisting  of  each  image  template  processed  on  both  sides  by  one  of  seven  (i.e.,  1  x  1  through  13 
X  13)  kernels.  An  equal  number  of  test  and  control  stimuli  were  presented  in  each  session.  The 
functions  in  only  one  panel  of  Figure  3  were  tested  in  a  given  session,  which  included  presentation 
of  all  four  image  templates.  In  a  given  session,  the  test  stimuli  were  presented  with  the 
unprocessed  image  appearing  on  the  right  10  times  and  on  the  left  10  times.  Each  control 
stimulus  was  accordingly  presented  20  times  per  session  since  no  distinction  could  be  made 
between  the  left  and  right  halves  of  these  stimuli.  Between  two  and  four  sessions  were  run  for 
each  observer  under  the  four  conditions  corresponding  to  panels  of  Figure  3.  All  stimuli  were 
presented  Im  fi’om  the  observer  and  they  extended  radially  for  40°  fi-om  a  fixation  point  located  at 
the  center  of  the  image.  Stimulus  duration  was  167  msec.  The  observer  was  seated  and  used  a 
chin  and  head  rest  to  maintain  a  constant  position  relative  to  the  screen. 

All  image  presentation  and  data  collection  were  under  the  control  of  a  Silicon  Graphics 
Iris  workstation.  The  stimuli  were  presented  using  the  green  channel  of  a  Sony  Superdata  CRT 
projector  (Model  VPH-1270Q),  and  a  rear-projection  screen  (Lumiglass  350,  Stewart  Film 
Screen  Corp.).  Subject  responses  were  made  using  a  mouse  interfaced  with  the  computer. 
Display  luminance  was  measured  using  a  Photo-Research  Model  1500  photometer.  The 
gamma-fimction  for  the  green  CRT  channel  was  measured  and  linearized  using  an  8-bit  look-up 
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Figure  4 

Two  of  the  Image  Templates  Used  in  Experiment  1 


table.  In  addition,  a  luminance  measurement  was  obUuned  at  24  equally  spaced  points  between 
the  center  and  edge  of  the  display  area  in  order  to  correct  for  the  luminance  gradient  caused  by 
the  projector  optics  and  the  directional  properties  of  the  rear-projection  screen.  The  lumiiumce 
gradient  was  corrected  off-line  by  appropriately  adjusting  the  8-bit  gray  scale  of  each  stimulus 
image.  The  mean  luminance  of  the  stimuli  was  O.S  fL. 

A  set  of  MTFs  for  the  display  system  was  calculated  from  horizontal  and  vertical 
line-spread  functions  associated  with  lines  of  lighted  pixels  located  at  the  center  of  the  display  and 
at  points  12°  and  28°  from  the  center  in  both  directions  along  the  horizontal  meridian. 
Measurements  were  made  using  a  Photo-Research  (Model  PR-719)  Spatial  Scanner,  and  software 
(see  Appendix  C)  that  implemented  the  procedures  for  calculating  MTFs  described  by  Kelly 
(1992).  The  horizontal  and  vertical  MTFs  calculated  for  the  display  system  are  shown  in  Figures 
Sa  and  5b,  respectively.  The  horizontal  MTFs  (each  calculated  from  a  vertical  line  of  pixels)  were 
virtually  identical  at  all  points  tested  across  the  screen.  There  were,  however,  differences  in  the 
vertical  MTFs  for  those  same  points.  As  a  result,  a  residual  blur  gradient  was  effectively  added  to 
all  horizontally  oriented  stimulus  features.  However,  as  the  data  of  Figure  Sb  show,  this  gradient 
was  relatively  small  and  in  a  direction  opposite  to  that  of  the  gradients  evaluated  in  the  present 
study.  Further,  in  the  present  study,  an  unprocessed  image  was  presented  on  all  trials,  and 
because  both  the  horizontal  and  vertical  MTFs  measured  at  a  given  distance  from  the  center  of  the 
image  were  similar,  no  differential  response  would  be  expected  based  on  the  measured  differences 
in  display  MTF.  The  data  presented  here  should,  however,  be  taken  as  conservative  estimates  of 
the  observers'  sensitivity  to  image  blur  since  higher  sensitivity  would  be  expected  had  the  residual 
blur  been  removed. 

Procedure 

Each  experimental  session  began  with  a  S-10  min  period  of  adaptation  to  the  low  ambient 
illumination  of  the  experimental  room.  The  observers  then  viewed,  for  2-3  min,  the  center  of  an 
illuminated  screen,  which  was  the  same  size  and  had  the  same  mean  luminance  as  the  experimental 
stimuli.  The  observer  initiated  the  experimental  session  using  a  mouse,  and  the  first  stimulus 
appeared  within  3-4  sec.  The  observers  were  asked  to  respond  as  to  which  half  of  the  image 
appeared  to  be  more  blurred.  There  were  3-4  sec  between  the  observer's  response  and  the  next 
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Figure  Sa 

Horizontal  MTFs  of  the  Stimulus  Display  System 
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Figure  5b 

Vertical  MTFs  of  the  Stimulus  Display  Syston 


stimulus.  The  experimental  session  consisted  of  between  320  and  480  stimulus  presentations 
broken  up  into  three  subsessions  each  consisting  of  as  close  to  one-third  of  the  trials  as  possible. 
The  observers  controlled  the  amount  of  time  between  each  subsession  and  could  also  initiate  rest 
periods  within  the  subsessions  by  delaying  their  response.  A  mean-luminance  screen  was  present 
between  stimuli  and  during  all  rest  periods. 


RESULTS 

The  discrimination  data  obtained  from  the  three  observers  are  summarized  in  Figures  6a, 
6b,  and  6c,  respectively.  The  data  for  both  the  bomber  and  city  images  were  similar  and  so  all 
data  were  pooled  for  each  observer.  The  four  plots  associated  with  each  observer  correspond  to 
the  four  sets  of  variable-resolution  functions  tested.  Based  on  the  percentage  of  correct  responses 
(%C),  three  levels  were  used  to  categorize  the  observers'  ability  to  discriminate  an  unprocessed 
image  from  an  image  processed  according  to  each  of  the  functions  within  each  set.  The 
dark-shaded  areas  of  Figure  6  encompass  the  variable-resolution  functions  that  were  discriminated 
more  than  85%  of  the  time,  while  the  white  areas  encompass  the  functions  that  were  discriminated 
at  near  the  chance  level  (<65%C).  The  functions  discriminated  between  65%  and  85%  of  the  time 
were  defined  as  being  at  or  near  threshold,  and  are  identified  by  the  gray-shaded  areas  in  the 
figure. 

The  data  of  Figure  6  show  that  observer  SF  was  able  to  discriminate  the  low-pass  filtered 
images  somewhat  better  than  observer  GG  who,  in  turn,  was  somewhat  more  sensitive  than 
observer  KV.  Despite  these  quantitative  differences,  all  observers  showed  qualitatively  similar 
patterns  of  sensitivity  across  the  four  sets  of  stimuli  tested.  For  instance,  whereas  discrimination 
performance  declined  as  the  size  of  the  integration  kernel  applied  to  the  peripheral  edge  of  the 
image  was  decreased,  the  decline  could  be  compensated  by  a  (generally  smaller)  increase  in  the 
size  of  the  kernel  used  at  the  more  central  edge  of  the  image. 

DISCUSSION 

The  second  function  from  the  top  in  the  third  panel  of  Figure  3  most  closely  approximates 
the  human  CMFN  suggested  by  Rovamo  and  Virsu  (1979).  When  the  data  for  all  three  observers 
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Figure  6b 

Blur  Discrimination  Data  for  Observer  GG 
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ECCENTRICITY  (deg) 

Figure  6c 


are  taken  together,  this  function  is  at  or  near  threshold  as  defined  in  the  present  study.  Of  course, 
many  of  the  other  fiinctions  are  also  at  or  near  threshold  by  that  definition.  It  was  not  the  purpose 
of  the  present  study  to  estimate  the  CMFN,  but  rather  to  use  the  CMFN  as  an  initial  estimate  of 
the  threshold  blur  gradient  for  wide-held  imagery. 

The  slope  of  the  CMFN  that  produced  a  minimally  discriminable  image  in  the  present 
study  was  very  similar  to  the  slopes  estimated  from  luminance  and  contrast  sensitivity  data 
(Johnston,  1987;  Rovamo,  1983;  Rovamo,  Virsu,  &  Nas^nen,1978).  This  similarity  at  least 
suggests  that  the  discrimination  of  blurred  images  may  be  dependent  on  image  properties  defined 
by  such  fundamental  measures  as  component  spatial  frequency  and  orientation.  Further,  similar 
data  have  been  obtained  here  for  visually  diverse,  complex  images,  suggesting  that  the  visual 
equivalence  of  variable-resolution  images  is  independent  of  local  variations  in  image  detail. 

The  data  of  Figure  6  will  be  used  below  (see  General  Discussion)  to  estimate  the  minimal 
sampling  rate  required  to  adequately  represent  typical  real-world  imagery.  In  the  context  of 
image  representation  schemes  recently  reported  (Ebrahimi  &  Kunt,  1991;  Porat  &  Zeevi,  1988), 
these  data  may  be  used  to  generate  efficient  imagery  that  consists  of  only  those  spectral 
components  to  which  each  region  of  the  visual  field  is  sensitive  (cf.  Zeevi  et  ai,  1990). 


EXPERIMENT  2:  BLUR  DISCRIMINATION  AS  A  FUNCTION  OF  IMAGE 
CONFIGURATION  AT  15°  AND  30°  ECCENTRICITY 

INTRODUCTION 

The  data  of  Experiment  1  suggest  that  wide-field  images  processed  using  position-varying 
filters  can  appear  preattentively  similar  to  an  unprocessed  image  for  certain  processing  gradients. 
Furthermore,  processing  near  the  center  of  the  image  can  be  effectively  exchanged  for  processing 
in  the  periphery  in  such  a  way  as  to  maintain  the  perceptual  equivalence  of  the  image  and  its 
unprocessed  counterpart.  In  Experiment  2,  sensitivity  to  image  blur  was  measured  for  circular 
and  radial-segment  apertures  of  various  sizes.  Data  were  obtained  at  15°  and  30°  eccentricity  in 
order  to  determine  whether  there  are  differences  in  the  spatial  properties  of  the  visual  mechanisms 
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underlying  the  detection  of  image  blur  at  these  two  eccentricities. 


METHOD 

Observers 

Data  were  obtained  from  three  observers,  one  of  whom  (GG)  participated  in  Experiment 
1 .  Observers  CT  and  DW  were  males,  20  and  26  years  of  age,  respectively.  Each  had  normal 
uncorrected  vision.  Observers  CT  and  DW  were  unaware  of  the  purpose  of  the  study,  and  each 
was  paid  to  participate  in  this  experiment. 

Stimuli  and  Apparatus 

Three  images  were  used  as  stimuli  in  this  experiment.  Two  of  the  images  were  those  used 
in  Experiment  1,  and  the  third  was  a  black-and-white,  one  cycle/degree,  radial  square  wave.  The 
stimuli  were  chosen  to  represent  three  levels  of  spatial  homogeneity  (bombers,  lowest;  square 
wave,  highest).  The  1024  x  1024  x  8-bit  images  were  processed  by  convolution  (Equation  1) 
with  gaussian  kernels  again  varying  in  size  from  3x3tol3xl3  pixels.  The  test  stimuli  were 
produced  by  applying  either  a  circular  or  radial-segment  aperture  (see  Figure  7)  on  both  sides  of 
the  fixation  point  of  both  the  oiiginal  and  processed  images.  The  apertures  were  centered  at 
either  15°  or  30°  along  the  horizontal  meridian.  The  sets  (circular  and  radial-segment)  of  stimuli 
presented  at  15°  eccentricity  had  areas  of  8.4,  16.8,  25.2,  33.5,  and  41.9  deg^.  The  sets  presented 
at  30°  eccentricity  had  areas  of  16.8,  33.5,  50.3,  67.0,  and  83.8  deg^  (i.e.,  corresponding  to  a 
magnification  factor  of  two  relative  to  the  stimuli  presented  at  15°).  The  display  system  was 
identical  to  that  used  in  Experiment  1 . 

Procedure 

The  testing  procedure  was  identical  to  that  of  Experiment  1 .  In  a  given  experimental 
session,  all  three  stimuli  and  all  five  stimulus  areas  were  tested  (30  repetitions  for  each  of  the  15 
conditions)  for  one  eccentricity,  one  stimulus  configuration,  and  one  level  of  processing  (i.e., 
blurring).  Three  randomized  blocks  of  trials  were  run  in  each  session,  with  the  three  stimulus 
types  processed  at  one  kernel  size  determining  the  block.  For  each  of  the  three  stimulus  types,  the 
five  stimulus  areas  were  tested  in  a  random  order  as  a  sub-block.  Each  experimental  session 
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Figure  7 

The  Circular  and  Radial-Segment  Apertures  Used  to 
Define  the  Stimuli  Used  in  Experiment  2 
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consisted  of 450  trials,  and  required  about  45  min  to  complete.  For  each  observer,  betweoi  three 
and  five  experimental  sessions  were  run  for  each  combination  of  eccentricity,  stumihis 
configuration,  and  kernel  size. 


RESULTS 

Shown  in  Figure  8  is  a  typical  set  of  response  fimctions  used  to  estimate  blur 
discrimination  thresholds.  The  data  of  Figure  8  wo’e  obtained  at  15°  eccentricity  for  the  lowest 
level  of  stimulus  homogeneity  (i.e.,  HI,  the  bombers)  and  represent  an  average  over  ail  three 
observers.  The  two  sets  of  data  in  each  pand  correspond  to  dther  the  circular  (filled  circles)  or 
radial-segments  stimuli,  and  the  five  panels  correspond  to  the  five  stimulus  areas  tested.  The  data 
points  in  each  panel  show  the  proportion  of  correct  responses  as  a  fimction  of  the  size  of  the 
summation  kernel  used  to  process  the  stimuli.  As  the  summation  kernel  increases  in  dze,  the 
amount  of  blur  increases  and  the  proportion  of  trials  for  which  the  processed  stimulus  is 
discriminated  fi’om  its  unprocessed  counterpart  also  increases.  The  fimctions  placed  through  the 
data  in  each  panel  of  Figure  8  represent  the  best  fitting  WeibuU  distribution  of  the  form: 

P=l-0.5exp[-(^)'j  (12) 


where  P  is  the  proportion  of  correct  responses,  K  is  the  kernel  size,  K,  is  the  threshold  kernel  size 
(corresponding  to  P  =  0.816),  and  s  determines  the  steepness  of  the  fimction.  The  function  of 
Equation  12  was  fitted  to  the  data  using  the  SigmaPlot  Scientific  Graph  System  (Jandel 
Scientific).  The  horizontal  dotted  line  in  the  figure  indicates  a  proportion  correct  of  0.816,  which 
was  taken  as  the  threshold  response  level. 

Blur  thresholds,  estimated  fi-om  response  functions  like  those  of  Figure  8,  are  plotted  in 
Figure  9  as  a  function  of  stimulus  area  for  both  of  the  eccentricities  tested  (15°  and  30°)  and  for  all 
three  levels  of  stimulus  homogeneity  (H1'H3).  The  functions  obtained  for  both  the  circular  and 
radial-segment  stimuli  show  a  general  decrease  in  blur  threshold  as  stimulus  area  is  increased, 
although  the  decrease  was  generally  greater  for  the  data  obtmned  at  15°.  There  appears  also  to  be 
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a  difference  between  the  two  eccentricities  tested  in  the  relative  decrease  in  blur  threshold  for  the 
two  stimulus  configurations.  In  order  to  test  this  difference,  t-tests  were  performed  on  each  of 
the  differences  between  the  two  curves  in  each  of  the  panels  of  Figure  9.  Because  five  /-tests 
were  performed  for  each  eccentricity/homogeneity  condition,  the  level  of  significance  for  the  tests 
described  here  was  reduced  to  />=0.01.  The  asterisks  placed  along  the  upper  axis  in  each  panel 
indicate  the  stimulus  areas  for  which  the  blur  thresholds  for  the  circle  and  radial-segment  stimuli 
were  significantly  different  at  this  level.  The  computed  /-values  and  degrees  of  fi'eedom  for  each 
of  the  significant  differences  indicated  by  asterisks  in  the  figure  were  as  follows:  /d^„=  3.92, 
/K0.002  (Hl/33.5),-  /  ,=  5  28,  /KO.OOl  (Hl/41.9);  /„=  3.23,  /KO.Ol  (H2/25.2);  3.06, 

/KO.Ol  (H2/41.9);  and  /„=  3.16,  /KO.Ol  (H3/25.2).  As  can  be  seen  in  Figure  7,  the  smallest 
circle  and  radial-segment  stimuli  are  very  similar  in  shape,  and  so  little  or  no  difference  between 
the  stimulus  configurations  would  be  expected  for  those  areas.  While  only  five  of  the  thirty 
differences  tested  were  statistically  significant,  all  of  the  significant  differences  were  found  among 
the  three  largest  areas  at  15“  eccentricity. 

Shown  in  Figure  10  is  the  difference  in  blur  threshold  between  the  15“  and  30“  eccentricity 
conditions  for  both  the  circle  and  radial-segment  stimuli  and  for  the  only  two  stimulus  areas  (16.8 
and  33.5  deg^  tested  at  both  eccentricities.  For  both  stimulus  areas,  the  difference  in  blur 
threshold  between  the  two  eccentricities  tested  was  significantly  different  from  zero  for  the  circles 
(16.8;/,,=  1.00,  /KO.Ol;  33. 8;/, 4=  4.04,  /K0.002)  but  not  for  the  radial  segments  (16.8:/,,=  0.42, 
p>0.50, 33.8./,,=  0.14,p>0.80). 

DISCUSSION 

The  data  of  Figure  9  show  that  there  is  a  general  decrease  in  blur  discrimination  threshold 
with  increases  in  stimulus  size.  This  is  true  for  both  the  circular  and  segmental  stimuli  presented 
at  both  15“  and  30“  eccentricity.  There  is  also  an  indication  in  some  of  the  functions  that  blur 
threshold  has  reached  an  asymptote  and  is  even  increasing  for  the  largest  stimulus  areas.  This 
effect  suggests  a  functional  limit  to  the  size  of  the  excitatory  portion  of  the  perceptu^  receptive 
field  (Westheimer,  1967).  The  fact  that  the  increase  in  threshold  is  more  obvious  for  the 
segmental  stimuli  further  suggests  that  the  perceptual  receptive  field  for  these  stimuli  may  be 
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Figure  10 

Difference  in  Blur  Thresholds  Between  15*  and  30“  Eccentricity 


organized  differently  than  for  circular  stimuli.  These  possibilities  will  be  discussed  further  below. 

The  decrease  in  blur  threshold  with  increases  in  stimulus  area  is  generally  greater  for  the 
stimuli  presented  at  1 5°  suggesting  that  the  effective  magnification  factor  of  two  applied  to  the  30° 
stimuli  did  not  fully  compensate  for  the  effect  of  stimulus  area  It  should  be  noted,  however,  that 
the  difference  in  the  threshold  change  is  larger  for  the  1 5°  stimuli  due  mainly  to  the  relatively  high 
threshold  at  the  smallest  stimulus  size.  Given  that  real-world  stimuli  were  used  for  homogeneity 
levels  HI  and  H2,  it  is  possible  that  this  high  threshold  is  related  to  the  chance  absence  within  the 
smallest  aperture  of  those  stimulus  features  that  are  most  easily  discriminated  under  the  conditions 
of  the  preser.t  study.  This  interpretation  is  supported  by  the  generally  lower  thresholds  at  all 
stimulus  sizes  for  the  square-wave  stimuli,  which  were  most  spatially  homogeneous.  It  is 
concluded,  therefore,  that  a  CMF  of  two  adequately  equates  the  blur  discrimination  data  obtained 
at  1 5"  and  30°  eccentricity. 

The  difference  in  blur  threshold  obtained  using  circular  and  segmental  stimuli  is  more 
obvious  and  only  statistically  significant  at  15°  eccentricity.  Although  some  of  the  differences  at 
30°  were  relatively  large,  the  variability  at  that  eccentricity  was  also  higher  thus  rendering  the 
differences  statistically  insignificant.  This  is  not  an  explanation  for  the  relevant  differences 
between  15°  and  30°  eccentricity,  however,  since  the  divergence  at  high  stimulus  areas,  of  the 
functions  corresponding  to  the  circular  and  segmental  stimuli,  is  consistent  only  at  15°  for  the 
homogeneity  levels  tested.  Thus,  the  data  of  Figure  9  suggest  that  at  15°  eccentricity  (and 
presumably  at  lesser  eccentricities),  the  visual  information  required  to  discriminate  image  blur  is 
not  summated  within  a  constant  eccentricity.  Rather,  the  perceptual  summation  area  is  more 
nearly  circularly  symmetrical.  On  the  other  hand,  perceptual  summation  at  30°  extends  over  a 
relatively  large  area  and  is  generally  unaffected  by  the  spatial  distribution  of  the  stimulus. 

As  noted  earlier,  two  of  the  areas  tested  at  1 5°  eccentricity  correspond  to  areas  tested  also 
at  30°  eccentricity.  As  can  be  seen  in  the  data  of  Figure  10,  the  difference  in  blur  threshold 
between  15°  and  30°  eccentricity  for  these  two  areas  is  much  larger  for  the  circular  stimuli  than  for 
the  segmental  stimuli.  It  was  concluded  above,  based  on  the  data  from  all  stimulus  areas  tested, 
that  a  CMF  of  two  was  sufficient  to  equate  the  blur  discrimination  data  obtained  at  15°  and  30°. 
That  conclusion  is  consistent  only  with  the  circular  data  of  Figure  10.  There  appears,  however,  to 
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be  no  significant  difference  in  the  blur  thresholds  obtuned  at  the  two  eccentricities  when 
segmental  stimuli  are  used.  This  is  further  evidence  of  a  difiTerence  in  the  spatial  properties  of 
perceptual  fields  at  different  eccentricities. 


GENERAL  DISCUSSION 

In  the  case  of  the  variable-resolution  images  used  in  the  present  study,  the  distortion 
function  is  derived  from  physiological  and  psychophysical  data  that  describe  how  visual 
processing  capability  decreases  with  eccentricity.  Since  the  distortion  function  is  known,  there  are 
two  ways  of  representing  the  variable-resolution  image  by  a  discrete  set  of  samples  without 
generating  artifacts.  First,  the  image  can  be  transformed  such  that  it  can  be  properly  represented 
by  a  uniform  set  of  samples.  This  is  done  by  using  the  inverse  of  the  distortion  function  to 
produce  a  bandlimited  image  (without  loss  of  information)  and  then  sampling  the  image  uniformly 
in  accordance  with  the  standard  Nyquist  criterion  (Jerri,  1977).  Stated  another  way,  if  an  image 
can  be  bandlimited  by  a  transformation  which  appropriately  distorts  the  spatial  axis,  then  it  can  be 
adequately  represented  by  a  discrete  set  of  samples.  The  second  way  of  representing  a 
variable-resolution  image  by  a  discrete  set  of  samples  is  to  sample  it  nonuniformly  at  a  rate 
determined  by  the  form  of  the  distortion  function.  The  image  can  then  be  reconstructed  using 
interpolation  functions  (i.e.,  sine  functions  distorted  in  accordance  with  the  nonuniform  spatial 
distortion  function).  Thus,  in  this  case,  the  variable-resolution  image  can  be  represented  by  a 
finite  set  of  sampling  points  even  though  it  is  not  bandlimited  and  as  such  does  not  satisfy  the 
Nyquist  condition. 

In  the  context  of  the  present  study  and  the  applications  discussed  earlier,  we  start  from  a 
uniform-resolution  image  and  generate  a  variable-resolution  image.  The  operation  performed  in 
this  case  can  be  viewed  as  filtering  which  is  analogous  to  the  bandpass  filter  applied  prior  to 
uniformly  sampling  an  image.  The  low-pass  filtering  is  performed  in  our  case  by  convolving  the 
image  with  a  gaussian.  This  operation  does  not  satisfy  the  requirement  of  projecting  an  image 
onto  the  space  of  BLFs  in  that  the  associated  measure  is  an  effective  bandwidth  (Gabor,  1946) 
and  hence  is  only  approximate.  [For  examples  of  variable-resolution  images  generated  by  filtering 
with  a  nonuniform  sine  function,  see  Zeevi  et  al.  (1988)  and  Peterfreund  and  Zeevi  (1992)].  In 
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the  case  of  nonunifonn  sampling,  the  filter  applied  prior  to  san^ling  is  nonuniform.  In  other 
words,  its  properties  (corr^ponding  bandwidth)  vary  as  a  function  of  position  in  accordance  with 
visual  requirements. 

Efficient  Sampling  of  Variable-Resolution  Imagery 

Numerous  image  processing  and  compression  techniques  have  been  developed  for 
representing  visual  images  efficiently.  These  are,  in  general,  based  on  the  redundancy  in  natural 
images  and  the  2-D  spatial  fi-equency  selectivity  of  the  human  visual  system  (Baddeley  & 
Hancock,  1991;  Field,  1987;  Reed,  Ebrahimi,  Marques,  &  Kunt,  1991).  Variable-resolution 
imagery  takes  advantage  of  another  salient  property  of  the  human  visual  system— its  spatial 
inhomogeneity.  One  practical  advantage  of  variable-resolution  imagery,  as  defined  here  (see 
Equation  2),  is  that  it  can  be  adequately  (by  a  visual  criterion)  sampled  by  fewer  points  than  its 
fixed-resolution  counterparts.  The  variable-resolution  technique  described  here  was  developed  to 
efficiently  represent  images  used  in  high-fidelity,  wide  field-of-view  displays  (Geri,  Zeevi,  & 
Porat,  1990).  Thus,  for  the  purposes  of  image  storage,  manipulation,  and  transmission,  it  is 
desired  to  exploit  the  variable-resolution  properties  of  the  image  such  that  the  data  set  can  be 
reduced  accordingly.  The  question  then  is,  how  should  the  sampling  points  be  nonuniformly 
distributed  over  the  image  field  such  that  they  adequately  represent  the  image  in  the  sense  that  the 
variable-resolution  image  can  be  reconstructed  from  this  set  of  sampling  points?  Intuitively,  it  is 
clear  that  as  the  data  become  more  sparsely  distributed  as  a  result  of  variable-resolution 
processing,  the  required  density  of  sampling  points  decreases.  In  theory,  the  sampling  rate 
appropriate  for  a  given  variable-resolution  image  is  determined  by  the  local  bandwidth  as 
discussed  above.  In  practice,  we  do  not  have  an  exact  measure  of  the  local  bandwidth,  therefore 
we  have  chosen  to  use  the  distortion  function  to  obtain  the  appropriate  sampling  rate  for  our 
variable-resolution  images.  As  noted  by  Zeevi  &  Shlomot  (1993),  the  local  bandwidth  is  related 
to  the  derivative  of  the  distortion  function.  According  to  the  sampling  theorem  for  images  which 
belong  to  the  space  of  LBLFs  (Zeeva  &  Shlomot,  1993),  the  set  of  nonuniformly  distributed 
sampling  points  can  be  obtained  by  applying  the  inverse  of  the  distortion  function  to  the  uniform 
grid.  This  has  been  done  in  one  dimension  using  as  a  distortion  function  one  of  the  functions  of 
Figure  3  that  was  found  to  be  at  or  near  threshold  for  all  of  the  observers  tested  in  Experiment  1. 
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Shown  on  the  left  in  Figure  11  is  the  chosen  distortion  fiuiction.  Shown  at  the  top  right  in  the 
same  figure  is  a  set  of  uniformly-spaced  sampling  points  (i.e.,  pixels)  corresponding  to  one-half 
(from  center  to  edge)  of  a  cross-cut  through  an  image.  Shown  at  the  bottom  right  in  Figure  1 1  is 
the  nonuniform  set  of  sampling  points  that  corresponds  to  the  chosen  distortion  function.  In  this 
case,  the  operation  results  in  a  reduction  by  a  factor  of  about  two  in  the  number  of  points  required 
for  the  representation.  This  would  correspond  to  a  reduction  of  about  four  when  the  full 
two-dimensional  image  is  represented  at  the  indicated  sampling  rate.  It  should  be  noted  that  this 
is  a  conservative  estimate  of  the  savings  that  might  be  expected  fi'om  variable-resolution 
processing  of  static  imagery  in  that  the  psychophysical  technique  used  here  for  assessing  the 
perceptual  equivalence  of  the  processed  and  unprocessed  images  was  designed  to  detect  very 
small  differences  in  the  appearance  of  the  images.  Much  larger  differences  in  the  images  might  be 
tolerated  if  the  perceptual  criterion  was  recognition  or  identification,  and  the  associated  sampling 
efficiency  in  that  case  would  be  significantly  greater. 
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Figure  1 1 

Nonuniform  Sampling  According  to  a  Measured  Distortion  Function 
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APPENDIX  A 


The  program,  vresfb.c,  used  to  generate  variable-resolution  imagery. 
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APPENDIX  B 

The  programs  used  in  the  experimental  portion  of  Experiments  1  and  2.  Programs  opaste.c, 
slice.c,  and  vcirc.c  were  used  to  generate  the  experimental  stimuli  from  the  unprocessed  original 
images  and  the  associated  variable-resolution  images.  Program  scale. c  was  used  to  apply  the 
luminance  gamma-correction  as  well  as  the  correction  for  luntinance  variations  across  the  imrtg'* 
caused  by  projector  optics  and  the  directional  properties  of  the  rear-projection  screen.  Progiar' 
segs.c  was  used  to  present  the  experimental  stimuli  and  collect  the  observers’  icsHonses  i) 
Experiment  2,  and  was  similar  to  the  program  used  for  this  purpose  in  Experiment  1  *"inalh 
program  asegs.c  was  used  to  analyze  the  observers'  responses. 
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^^'i#'>>f'>*^-«^'«f>X’<«f>r'<r'^<<>^'Nr'<#N^^sr>ftA*/^i/>iOtf^u>i/>inu^u^y^intnt/>i/>ini/>i/>iniAinir>4nirii/>in4nioininirikAininininknin«nfcA 
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540:  return(( int) value); 
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Created  by:  Craig  A.  Vrana  -  UDRI 

Date  Created:  12-JAN-90 
Last  Modified:  15-HAr-92 

This  program  was  wripen  to  analyze  the  raw  data  files  produced  by 
the  variable-resolution  study  program.  The  format  of  the  raw  data 
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format  character  in  two  fprintf  statements. 
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APPENDIX  C 


The  program,  mtf.c,  used  to  calculate  the  MTFs  of  the  wide-field  display  used  in  the  present 
study.  The  program  was  written  to  implement  the  procedures  for  calculating  the  MTF  described 
by  Kelly  (1992). 
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meas  data [128] 
f i lt~data(500i 
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fptr  *  fopen(name2,"r''); 
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